Speech Enhancement Using Principal Component Analysis and Variance of the Reconstruction Error Model Identification

نویسندگان

  • Amin Haji Abolhassani
  • Douglas O'Shaughnessy
  • Jacob Benesty
  • Peter Kabal
چکیده

In recent years, Automatic Speech Recognition (ASR) systems designed to work in controlled environments using clean speech have reached very high levels of performance. However, the accuracy of speech recognition degrades severely when the systems are operated in noisy environments. In this thesis we address the problem of single-channel speech enhancement. Starting with a study of the state-of-the-art enhancement methods, a comprehensive study of different categories of speech enhancement is presented. As an important class of speech enhancement methods, subspace-based speech enhancement is presented in chapter 2. After a careful study of all forces and drawbacks of this technique, a generalized form of Principal Component Analysis-based (PCA-based) speech enhancement is provided next. As a vital issue in PCA-based enhancement methods, identification of the clean speech signal's model is investigated in chapter 3. Some recent techniques to define the rank of a clean speech signal are presented in this chapter. In the rest of the thesis, a novel technique for rank estimation is developed. We introduce therefore a novel approach for the optimal subspace partitioning using the Variance of the Reconstruction Error (VRE) criterion. This criterion provides consistent parameter estimates and allows us to implement an automatic noise reduction algorithm that can be simply applied to the observed data. This choice also overcomes many limitations encountered with other selection criteria, like overestimation of the signal subspace or the need for empirical parameters. We have also extended our subspace algorithm to take into account the case of colored and babble noise. Informal listening tests and illustrations have confirmed the method to be numerically noise robust regardless of the type of the noise. ii Acknowledgements

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech enhancement using PCA and variance of the reconstruction error model identification

We present in this paper a subspace approach for enhancing a noisy speech signal. The original algorithm for model identification from which we have derived our method has been used in the field of fault detection and diagnosis. This algorithm is based on principal component analysis in which the optimal subspace selection is provided by a variance of the reconstruction error (VRE) criterion. T...

متن کامل

A New Method for Speech Enhancement Based on Incoherent Model Learning in Wavelet Transform Domain

Quality of speech signal significantly reduces in the presence of environmental noise signals and leads to the imperfect performance of hearing aid devices, automatic speech recognition systems, and mobile phones. In this paper, the single channel speech enhancement of the corrupted signals by the additive noise signals is considered. A dictionary-based algorithm is proposed to train the speech...

متن کامل

Speech enhancement based on hidden Markov model using sparse code shrinkage

This paper presents a new hidden Markov model-based (HMM-based) speech enhancement framework based on the independent component analysis (ICA). We propose analytical procedures for training clean speech and noise models by the Baum re-estimation algorithm and present a Maximum a posterior (MAP) estimator based on Laplace-Gaussian (for clean speech and noise respectively) combination in the HMM ...

متن کامل

Speech Enhancement using Adaptive Data-Based Dictionary Learning

In this paper, a speech enhancement method based on sparse representation of data frames has been presented. Speech enhancement is one of the most applicable areas in different signal processing fields. The objective of a speech enhancement system is improvement of either intelligibility or quality of the speech signals. This process is carried out using the speech signal processing techniques ...

متن کامل

A Novel Frequency Domain Linearly Constrained Minimum Variance Filter for Speech Enhancement

A reliable speech enhancement method is important for speech applications as a pre-processing step to improve their overall performance. In this paper, we propose a novel frequency domain method for single channel speech enhancement. Conventional frequency domain methods usually neglect the correlation between neighboring time-frequency components of the signals. In the proposed method, we take...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008